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^ Abstract. 
Ph. 

. We construct an adaptive asymptotically optimal in order in the L{2) sense a solution 

! (estimation) of an integral linear equation of a first kind and energy of this solution with 

the confidence region building, also adaptive. 
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^ : 1 Statement of problem. 

o 

o . 

Let us consider the following linear integral equation relative a unknown Riemann inte- 
grable function / : [0, 1] — )■ (— oo, oo) of a first kind (ill posed problem, see [28]): 



X 



or equally 



[\{U- s)f{s)ds + aei = y{t,), (1) 

JO 



g{t,) + at, = y{ti), g{t) =^ R * f{t), (2) 



with the data y{ti) obtained in the uniform set ti = i/n, i = 1,2, . . . ,n with the random 
errors of measurement (noise) cxej. 

Here {a ■ ej}, Var(ej) = 1 are centered: Ee^ = independent random variables, which 
may be defined on some probabilistic space P) with an expectation E and variance 

Var, errors of the measurements at the points ti, a = const G (0, oo); i?(-) is the kernel, 
which may be generalized function; the arithmetical operation under the arguments t, s 
are understood modulo 1 (periodicity). 

The consistent l/\/n estimation o"(n) of the variable a is obtained in the article [11] 
by means of the Residual Sum of Squares (RSS) method; indeed. 



n 

<y\n) = Y.{y{ti)-Un,t,)f/{n-l), 



where fo{n, t) is some preliminary estimation of the function /; for instance, fo{n, t) may 
be obtained by means of the Minimum Square Estimation (MSE) method. 

More exact estimation of the variance of error o"^ see in article [16]; in this work are 
offered and investigated the so-called supereffective Generalized Bayes Estimators for cr^. 

We will suppose therefore for simplicity that the value a is known. 

We consider the asymptotical statement of problem: n — > oo. 

Notice that in the case R{t) = S{t), where (){t) denotes the Dirac delta-function, the 
problem (1) coincides with the classical regression problem of nonparametric statistics; 
the optimal in the L2(J1 x [0, 1]) sense adaptive algorithms for solving (estimation, on 
the statistical language) of the function /(■) is described in many publication; sec, for 
example, articles [1], [2], [3], [4], [5], [6], [7], [8], [9], [11], [13], [15], [18], [19], [20], [21], 
[23], [24], [25]; monograph [29] and reference therein. 

Our goal is to offer and investigate an adaptive asymptotically as n — > oo 
optimal in the L2{^ x [0, 1]) sense estimation of the unknown function /(•) bcised 
on the observations {y{ti), i = 1, 2, . . . , n}. 



2 Denotations. Assumptions. 

1. Let X — [0,l\,t E X,T — {(f)k{t)} be the classical complete normalized trigonometrical 
system on the set X: (j)i{t) — 1, (j)2{t) — -\/2cos(27ri), (j)3(t) — -\/2 sin(27ri), 

Mt) = A/2cos(47rt), 05(t) = V2sm(4:TTt),(p(i(t) = a/2 cos(67rt), 07(t) = y2sin(67rt), .... 

Define for the measurable function g : [0,1] ^ R from the equality (2) 

oo 

git) = J2<k)Mty, (3) 

k=l 

here ^ 

cik) = / git) Mt) dt 
Jo 

be the Fourier coefficients of a function g{-) over the system T. 
2. Let us write the formal Fourier expansion for the kernel R{-) 

oo 

R{t)^J2Mt)Mk), 

k=l 

then formally 

oo 
k=l 

We assume 

inf \w{k)\ > 0, 36 = const G [0, oo), \w{k) \ x . (4) 

k 

It is evident that in the case 9 = the sequence {|'u;(A;)|} is bilateral bounded: 

< inf [^(/c)] < sup |ty(A;)| < oo. 
k k 



Notice that under the conditions (4) and y{-) G L(2) the hmit as e — >■ 0+ and n — >■ oo 
equation, i.e. an equation 



R*f{t) = y{t), te{0,l) 



has an unique a.e. solution / = /(^)- 
3. Let us denote 



N 2N 

k=l k=N+l 

oo 2N 

k=N+l k=N+l 

A{N,n) = a'^S{N)/n + p(N), N+ = N+(n) = 



mm 



Ent(n/ log(n + 8)), Ent((n/ log(n + 8))^/^^^^) 



Entfz] denotes here the integer part of the real positive variable z; 

A*{n) = min A{N,n), Nq = aigmin A{N , n) . 
Ne[i,N+] Ne[i,N+] 



Note that ii f e B{w), then 
Therefore 



lim p(N) = 0. 



lim A*(n) = 0, 

as long as 

' S (Ent(nVW)) + p (Ent(nVW)) — > 0, n — >■ oo. 

Assumptions: 

< 7_ lim^^^p(2iV)/p(iV) < ISI^r^^p(2iV)/p(iV) < i, (5a) 

AND WE SUPPOSE FOR THE CONSTRUCTION OF CONFIDENCE REGION FOR THE FUNC- 
TION / THAT 

3 hm p(27V)/p(7V) 11/76 (0,1). (56). 
Another conditions: 

r+ mij,^^S{2N)/S{N) e (1, 00), (6a) 



OR MORE STRICTLY 



and we denote 



3 r limr,^^S{2N)/S{N), T e (1, 00); (66) 



U = C/(7-, r_) = min [(1 - 7_), (r_ - 1)] . 

The condition (6a) is satisfied if \w{k)\ x fc^, k ^ oo; the condition (6b) is satisfied if 
|u'(A;)| ~ /c^, A; — >■ oo. 

This conditions are direct analogues of the notorious A2 condition in the theory of 

Orhcz's spaces. 

The conditions (5a) and (6a) are satisfied, for instance, when as A; ^ 00 

\c{k)\ X k-^, A = const, A > 2^ + 1/2. (7) 
It may be considered analogously a more general case when 

inf |w(A;)| > 0, 3^ = const e [0, 00), \w{k)\>^ Li{k) k^, 

k 

\c{k)\ X L2{k) k-^, A = const, A > 2^ + 1/2, 

where Li{k), L2{k) are slowly varying as A; ^ 00 functions. The detail investigation of 
such a functions see in a book [26]. 

4. We denote by c{k,n) usually consistent l/\/n estimations of a Fourier coefficients 
c{k) of the function g{-) based on the sample y{ti),i — 1,2, . . . ,n, namely: 

n 

c{k,n)^n-'Y.yiti)-Mti), (8) 

i=l 

and define correspondingly 

2N 

T{N,n)^ ^ w^{ky{k,n), T*{n) ^ ^min^^T{N,n), (9) 

M{n) = argmin r{N, n). (10) 

Ne[i,N+\ 

Note that the variables t{N ,n),T*{n) and M(n) are random variables which dependent 
on the source data {y{ti)}. 

The consistent as n ^ cxo estimation 7(n) of the parameter 7 under condition (5b) is 
described in [21], chapter 5, section 13. Namely, we define 



G = G{n) =^ Ent(exp(Yaogn)); 

then the value 7 may be consistent under condition (5b) estimated by means of statistic 
7(n) as follows: 

^ rm-2ri2G) 
' t{2G) - 2t{G) ■ 

5. Further, we suppose for the construction of asymptotical confi- 
dence REGION THAT 



supE(ei)'^ < 00, 

i 



(11a) 



AND WE SUPPOSE FOR THE CONSTRUCTION OF NON- ASYMPTOTICAL CONFIDENCE RE- 
GION THAT 3q, Q = const > 0, 

sup max [P(ej > m), P(ei <-«)]< exp (-(m/Q)^) , u > 0. (116) 

i 

We will write, as ordinary, in some concrete passing to the limit, for instance, n — > oo, 

X(n) - Y(n) ^ lim X(n)/Y(n) = 1, 

X{n) X Y{n) ^ miX{n)/Y{n) < sup X{n)/Y{n) < oo. 

n 

all the relations between the random variables, for example, passing to the limit, are 
understood with probability one (modP). 



3 Modular spaces. 

Let again X = [0, 1],,t G X, T = {^^(.t)} be the classical complete normalized trigono- 
metrical system. Define for the measurable function g : [0,1] ^ R 

oo 
k=l 



here 



= f f{x) (pkix) dx 
Jo 



be the Fourier coefficients of a function /(■) over the system T. 

Let also w — {w{k)} be positive number sequence (weight). We introduce the following 
modular space B{p,w) = B(T,p,w), which will called modular weight space, consisting 
on all the (measurable) functions {g} with finite norm 



M|5(P,^) = II/IU =^ 



i/p 



,pe(l,oo). (12) 



T.^'{kMkW 

lk=l 

For instance, the Sobolev's spaces = W^[0, l],m = 0, 1,2, . . . consisting on all the 
periodical (mod 1) m— times differentiable functions / with finite norm 



(ll/l|i^2[0,l])^ + (||/"^)||L,[0,l])^ 



2 

is weighted modular space relative the system T. 

Of course, the B{w) space with w{k) — 1 coincides with the ordinary space L{2) on 
the set [0, 1]. We will write in this case 

11/11 = ||/||L(2) = ||/||S(1). 



The complete investigation of these spaces see in the book [17]. 

In the case when p = 2 we will write for brevity B{2, w) = B{w). Of course, the space 
B{w) = B{2,w) is (separable) Hilbert space. 



The notion "L2(f2 x [0, 1])" sense means by definition that we consider the following 
loss function: 

V{h{n,-)J)'^'E\\h{n,-)-f{W- (13) 

where h{n, •) is arbitrary estimation of a function /(•) based on the observations {ui}, i — 
l,2,...,n. 

We suppose in addition that 



oo 



/ e L(2) ^ WfW ^Y.^^ik)w\k) < oo. (14a) 



fe=i 



The condition (14a) will be used for non-adaptive estimation of a solution /. For the 
construction of the adaptive estimation we need to assume the following condition (14b): 

oo 

/ e B{w) ^ \\f\\\B,w) = Y.c\k)w\k) < oo. (146) 

k=l 



4 Main result. Construction of our solution (esti- 
mation). 

Let us consider a projection, or Tchentsov's estimation of a function g{-) of a view 

N 

g{N, n, t) = ^ c{k, n)(f)k{t), N = const e [1, n/3], 

k=l 

and we construct correspondingly the following projection estimation of a function /(•) : 

N 

f{N, n,t)^J2 ^) ^(^) Mt)^ N = const e [1, n/3]. (15) 

k=l 

We find by the direct calculation using the condition /(•) € B{w) as n — >■ oo : 

V{f{N,n,-)Ji-))r^A{N,n). (16) 



If we choose 

N ^No =^ argmin A{N, n), 
Ne[i,N+] 

then we conclude that the optimal speed of convergence for non-adaptive estimation 
f{No, n, x) is asymptotically A*{n) : as n ^ oo 

E||/(n,7Vo,-)-/(-)ir~^»- 

At the same result up to multiplicative constant is true for our adaptive estimation, which 
we built by the following way. Namely, let us define 

f(n,t)'i^ f(n,M(n),t). (17) 



The asymptotically exact adaptive estimation of the solution /(■) may be obtained 
under the conditions (5b), (6b), (11a), and (14b) by using the so-called penalty function 
method. Indeed, if we define 



Ml (n) '^^ argmin n {N, n) , (18) 
Neli,N+] 

Ti {N, n) =^ t{N, n) + (2 - 7(71) - T) • a\n) ■ S{N) jn = 

J2 c^{k,n)w''{k) + {2-j{n)-r)-a''{n)-S{N)/n, (19) 

k=N+l 

Ti*(n) := min Ti(N,n). (20) 

The function (2 — j{n) — T) ■ (T^(n) • S{N)/n is said to be penalty function. 
We introduce a new estimation /(n, t) of the function f{t) as follows: 

/(n,t) t//(n,Mi(n),i). (21) 
We assert that under the conditions of (5b), (6b), (lib) and (14b) 

V{f{n,-),f{-))<A*{n){l + i.{n)), (22) 

where lim„_^oo J^in) — 0. 

Theorem 1. We suppose that all our conditions, in particular, the conditions (5a), 
(6a), (11a), and (14a) are satisfied. Then 

EK„^oo^(/(n, •), f{-))/A*{n) < 1, (23a) 

liS„_oo||/(n,-)-/(-)llV^*(n) < lM7-,r_) (modP). (236) 

Remark 1. Note that the estimation f{n,-) is optimal in general case, namely, in the 
case when [/(7_,r_) ^ 1 — . See [12]. 

Theorem 2. We have under the conditions of (5b), (6b), (11a) and (14b) 

Vifin, ■), /(■)) < A*{n){l + z/i(n)), (24a), 

lSn^oo||/(n,-) -/(Oir/^rH < 1- (246), 

where lim^^oo ^lin) = 0. 

The proof is at the same as the proof of theorem 1, see further. 



5 Energy estimation. 



Let us define the energy functional, or briefiy energy H — H{f) of the signal /(•) as 
usually 

H^H{f)'i^^\\mL{2))^ f f{t)dt. (25) 

We estimate in this section the energy functional H — H{f); we will prove that under 
natural conditions, in particular, / G B{w), there exists an estimation H(n,f) which 
convergent to the H{f) with the speed l/y/n (optimality). 

The necessity of the condition / G B[w) for the possibility of optimal in order esti- 
mation H(f ) is proved in [10], [14]. 

We describe now our energy estimation H{n,f), which is based on the our adaptive 
estimation /(n, ■) and is different on all others such estimations, see [9], [10], [14] etc. 

Our estimation is some slight generalization of offered therein. 

We introduce the following energy estimation functional: 

M 

H{n,f) = ^c\k,n)w\k) - a\n)S{M)/n. (26) 

k=l 

We will called the function a^{n)S{M)/n as anti-penalty function. 

Theorem 3. Let / G B{w). Assume in addition to the conditions (5b), (6b). (11a) 

lim S2(No(n))/n = 0, lim p(No(n)) / y/n = 0. (27) 

Then the statistics H{n,f) is consistent and optimal up to multiplicative constant 

estimation of the energy value H : 

limn- E(i/(n, /) - H{f)f = A ■ a' ■ 11/1^(^(^0)). (28) 

Notice that it follows from the proposition (28) the optimality of energy estimation 
H{nJ),see [10]. 



6 Proofs. 

A. Investigation of the function estimations; theorems 1 and 2. 

1. The proof is at the same as in [21], chapter 5, section 13; see also [1], [18] etc., where 
is considered the case R{t) = S{t), i.e. the case of the classical regression problem in the 
nonparametric statistics. Namely, it is proved therein that as n — > oo the empirical Fourier 
coefficients c{k, n) are common asymptotically independent and have normal distribution 
with the parameters 

Law(c(A;,n)) = N{c{k),a'^ /n). 

The detail proof of this assertion see, e.g. in [2], [5], [9], [10], [18], [29]. 
Therefore, we can write the following representation 



c{k, n) = c(k) + aC(k)/V^, (29) 

where the random variables {C(^)} common independent and have standard normal dis- 
tribution. 

2. We find by the direct calculations using the representation (29) for the projection 
estimations f{N, n,t) as N,n ^ oo, N e [1, N~^] : 

N 

\\f{N, n,-)-f\\'-a' n-' ^ C{k)w\k) + p{N) = 

k=l 

N N 

p{N) + n-' Yl + ^ w'ik) {C{k) - 1) =^ 

k=l k=l 

A{N,n)+^{N,n). (30) 
It follows from the Law of Iterated Logarithm (LIL) that 

lim„^oo^niin_^^/3(A^,n)/^*(n) = 0, 
A{N, n) = E\\f{N, n, •) - /I r - p{N) + a'n-' S{N), 

therefore 

lSn-.oo^(/(n, •), fi-))/A*{n) < 1/(1 - 7+). 
3. As long as for n — >■ oo 

Er(n) > p{N){l - 7.) + n-\T_ - 1)S{N) > U{-f_, T_) ■ A{N,n), 
A*{n)<ET*{n)/U{^.,T_), 

we conclude 

IS„_oo^*(n)/T*(n) < l/C/(7_,r_). 

This completes the proof of theorem 1. Theorem 2 is proved analogously. 
4. Further, we conclude analogously by means of the condition (5a): 

E[t{N, n)] X p{N) + n-^ S{N) = A{N, n). (31) 

If we take in (31) the value N — Nq, we conclude 

E\\f{No,n,-)-f\\'^A*{n). 

Hence the estimation f{NQ,n,-) is consistent estimation of a function /(•) in the L(2) 
sense. 

It follows from the theorems Tchentsov [27] and Ibragimov-Chasminsky [12] that the 
speed of convergence ^ A*{n) as n — > 00 is optimal in general case. But this method of 
solution need the value p{N) or at least its order as — >■ 00. This solution f{No,n, •) is 
called non-adaptive. 



The solution (estimation) which does not use any apriory information about estimating 
function /(•) will be called adaptive. 

5. We compute as N,n ^ oo,N e [1, A'"+] 

Var[T(iV,n)] x n"^ [S2{N) + n-' p2{N)]. 
We conclude by virtue of adaptive conditions that for some positive constant ^ > 

\ai[r{N, n)] < C n'^ ■ A{N, n). 

Further considerations are alike to the [21], chapter 5, section 13; see also [1], [18]. 

Note that it is proved there that under conditions (5b), (6b) that with probability one 
and in the L2{fl) sense 

lim Mi(n)/No = 1, lim p(Mi(n))/p(No) = 1. (32) 

As a consequence: the estimation /(n, •) is consistent estimation of the solution / in the 
L(2) sense with probability one: 

||/(n,.)-/(-)ll'^0,n^oo(modP). 
At the same result is true for the estimation /(n, ■). 

B. Investigation of the energy estimations; theorem 3. 

1. We have for the energy estimation H{n, /), (M = M{n)) : 

M 

H{nJ) r~.J2^'ik)[c{k) + aak)/V^f - a'S{M)/n - 

k=l 

MM M 

c'{k)w\k) + 2an-^''- ^ w^{k)c{k)C{k) + a^n-' ^ w^{k){C{k) - 1) ~ 

fe=l fe=l k=l 



No No No 

J2 c\k)w\k) + 2(771-^2 ^ w\k)c{k)C{k) + a^n-^ ^ w\k){C\k) - 1) = 

k=l k=l k=l 

H{f) - p{No) + 2ari,{n) + a%{n), (33) 

where 

No 

n,{n)^n-'/^Y.^\k)c{k)C{k), 

k=l 

M 

V2{n) = n-'Y.^\k){e{k) - I). 

k=l 



It follows from the condition (27) that 



No 

lim n • E(//(n, /) - H{f)f = lim Var[^ w\k)c{k)ak)] = 



n— >-oo 

k=l 



oo 

Var[J2w\k)c{k)ak)] = \\mB{w)) < oo, 
fe=i 

as long as ||/||(-B(w)) < oo. 

This completes the proof of theorem 3. 



7 Confidence regions. 

A. Confidence interval (adaptive) for the function / in the L{2) norm. 

For the rough building of the confidence domain, also adaptive, in the B{w) norm we 
proved the following result. 

Theorem 4. We assert under at the same conditions as in theorem 1 

\i^n^^T*{n)/A*{n) < l/^7(7_, r_) (34) 

and correspondingly 

]i^n^Mn,-)-f{W/r*(n) < l/C/(7-,r_). (35) 
Therefore, we conclude: with probability tending to one as n ^ oo 

||/(n,-)-/(-)ir<l-05-r»/(l-7+)^ (36) 

But in general case the value 7+ is unknown; in order to construct estimable confidence 
region, we need to suppose more strictly conditions (5b), (6b), (lib) and (14b). 

Theorem 5. We assert under the conditions (5b), (6b), (11a) and (14b) that the 
variables 

A»1i/||/(n,.)-/(-)ir 

may be represented as follows: 

A^(n) = T*{n) + a{n) ^J2M^{n)/n x ^(n), (37) 

where the sequence of a random variables ^ (n) has asymptotically standard normal (Gaus- 
sian) A'"(0, 1) distribution: 

jim Law(^(n)) = 7V(0, 1). (38a) 

The Proof follows from the decomposition (30) and the Central Limit Theorem (CLT) 
for the sum 

e(n) = [a{n) V2Mi(n)/n]-i e\^(^) - 1). 

k=l 

Notice that 

\rmJ^M,{n)/n]/A*{n) = (modP). 



Non-asymptotical confidence interval for /(■) may be constructed as in [1] in the case 
classical regression problem R{t) — 6{t) under the condition (lib). Namely, let us denote 



r = r{q) — min(g/2, 2); 

then 

sup max[P(C(n) > Qu), P(CH < -Qu)] < exp (-Cm^) . (386) 

n>16 

B. Adaptive confidence interval for the energy. 

The next result may be proved analogously, by means of decomposition (32) instead 
(30) used by the proof of theorem 5. 

Theorem 6. We assert under the conditions (5b), (6b), (11a), (14b) and the con- 
dition / e B{w) that the estimation V{n) has asymptotically normal distribution with 
parameters 

LMH{nJ)) N{Hif)A^WB(iv)/n). (39) 
Remark 2. The value | |gf| 1^(5(1/;^) = 11/11^5(1/;) may be estimated alike the value V — 
II/IP- 

Remeirk 3. In the case when w{k) — 1, i.e.when g(t) — f{t) or equally R{t) — S{t), we 
have IblpS(w) = ll/lp = //(/); hence 

Law(if(n, /)) ~ N{HU)AcT^HU)/n). (40) 
Using the Fisher's transform, we conclude that the variable 

M^H{n,f) - ^[H{f))la{n) (41) 
has asymptotically Gaussian standard distribution. 



8 Concluding remarks. 

1. The optimal consistent estimation g{n,t) in the L(2) sense of a function g(-) ("Regres- 
sion problem") offered in [1], [18], [21] has a view: 

gin.t) = c{k,n) (pkit), 

k=l 

where 

2N 

Ni{n) — argmin ^ c^{k,n). 

Are[l,Ent(0.5n)l k=N+l 

But the estimation of a function /(•) of a view 

iVi(n) 

fiiiT^t)^ c{k,n) w{k)(j)k{t), 

k=l 



based on the estimation g{n,t), is not optimal when hmfc^.oo |^(^)| — oo- 

2. Adaptive estimation in general modular spaces. 

In order to construct an adaptive optimal in order as n ^ oo estimation of the function 
/ in the B{p, w), p & (1, oo) norm, we introduce the following function: 

M(p) (n) 

fp\n,x):= w{k)c{k,n)(j)k{x), 

k=l 

where 

2N 

M^\n):^ argmin ^ w^{k)\c{k,n)\^ . 
iVe[i,(iv+)Vp] k=N+i 

More detail investigation will be publish in an another article. 

3. Multidimensional case. 

We consider in this subsection the following multidimensional generalization of our 
problem. Let Z{n),n = 16, 17, ... be a sequence of a vector- valued sets (plans of experi- 
ences) in the cube [0, l]'', d — 2,3, ... : 

Z{n) = {Xi = U = ti{n), }, ti e [0, 1]"^. 

At the points we observe the unknown signal (process, field) / — f{t), t e [0, 1]'' on 
the background noise: 

y{U) = R* f{ti) + a 

where the sequence noise {crei}, is the sequence of errors of measurements, is the sequence 
of independent (or weakly dependent) centered: E^j = normalized: Var(ei) = 1 random 
variables, a — const > is a standard deviation of errors. 

The investigation of this problem in the case p = 2 and R{t) = (){t) sec in [22]; we will 
only emphasis here the neediness the using optimal planing of experience, in other words, 
experience design. 

4. Example. 
Let again 

\c{k) \ ~ Cik-^, \w{k)\ ~ C2k^, A,0^ const > 0. 

If 

A > 2^ + 1, 

then as n — >■ oo 
and 

E\Vin,f)-V{f)\'r^C,{A,9)n-\ 
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